Integrating ambiguously aligned regions of DNA sequences in phylogenetic analyses without violating positional homology.

نویسندگان

  • F Lutzoni
  • P Wagner
  • V Reeb
  • S Zoller
چکیده

Phylogenetic analyses of non-protein-coding nucleotide sequences such as ribosomal RNA genes, internal transcribed spacers, and introns are often impeded by regions of the alignments that are ambiguously aligned. These regions are characterized by the presence of gaps and their uncertain positions, no matter which optimization criteria are used. This problem is particularly acute in large-scale phylogenetic studies and when aligning highly diverged sequences. Accommodating these regions, where positional homology is likely to be violated, in phylogenetic analyses has been dealt with very differently by molecular systematists and evolutionists, ranging from the total exclusion of these regions to the inclusion of every position regardless of ambiguity in the alignment. We present a new method that allows the inclusion of ambiguously aligned regions without violating homology. In this three-step procedure, first homologous regions of the alignment containing ambiguously aligned sequences are delimited. Second, each ambiguously aligned region is unequivocally coded as a new character, replacing its respective ambiguous region. Third, each of the coded characters is subjected to a specific step matrix to account for the differential number of changes (summing substitutions and indels) needed to transform one sequence to another. The optimal number of steps included in the step matrix is the one derived from the pairwise alignment with the greatest similarity and the least number of steps. In addition to potentially enhancing phylogenetic resolution and support, by integrating previously nonaccessible characters without violating positional homology, this new approach can improve branch length estimations when using parsimony.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Incorporating information from length-mutational events into phylogenetic analysis.

With the growing number of phylogenetic studies that use length variable DNA sequences, incorporating information from length-mutational events into phylogenetic analysis is becoming increasingly important. A new method, modified complex indel coding is described that aims at maximizing the phylogenetic information retained from unambiguously aligned sequence regions or regions where the princi...

متن کامل

Sequence Analysis and Phylogenetic Study of Hemagglutinin Gene of H9N2 Subtype of Avian Influenza Virus Isolated during 1998-2002 in Iran

Sequence analysis and phylogenetic study of hemagglutinin (HA) gene of H9N2 subtype of avian influenza virus isolates (outbreaks of 1998-2002) in Tehran province (Iran) were studied. Two sets of forward and reverse primers in highly conserved regions, based on sequences of HA gene in Genbank, were designed. PCR products of a 430-bp fragment of 16 isolates were sequenced and then were aligned wi...

متن کامل

Evolution and phylogenetic utility of alignment gaps within intron sequences of three nuclear genes in bumble bees (Bombus).

To test whether gaps resulting from sequence alignment contain phylogenetic signal concordant with those of base substitutions, we analyzed the occurrence of indel mutations upon a well-resolved, substitution-based tree for three nuclear genes in bumble bees (Bombus, Apidae: Bombini). The regions analyzed were exon and intron sequences of long-wavelength rhodopsin (LW Rh), arginine kinase (ArgK...

متن کامل

Morphology, molecules, and the phylogenetics of cetaceans.

Recent phylogenetic analyses of cetacean relationships based on DNA sequence data have challenged the traditional view that baleen whales (Mysticeti) and toothed whales (Odontoceti) are each monophyletic, arguing instead that baleen whales are the sister group of the odontocete family Physeteridae (sperm whales). We reexamined this issue in light of a morphological data set composed of 207 char...

متن کامل

The Phylogeny of Calligonum and Pteropyrum (Polygonaceae) Based on Nuclear Ribosomal DNA ITS and Chloroplast trnL-F Sequences

This study represents phylogenetic analyses of two woody polygonaceous genera Calligonum and Pteropyrum using both chloroplast fragment (trnL-F) and the nuclear ribosomal internal transcribed spacer (nrDNA ITS) sequence data. All inferred phylogenies using parsimony and Bayesian methods showed that Calligonum and Pteropyrum are both monophyletic and closely related taxa. They have no affinity w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Systematic biology

دوره 49 4  شماره 

صفحات  -

تاریخ انتشار 2000